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Description 

[0001] This application is a continuation in-part of U. 
S. Application No. 10/410,736, "Method and Apparatus 
for Suppressing Wind Noise," filed April 1 0, 2003. The 
disclosure of the above application is incorporated here- 
in by reference. 

BACKGROUND OF THE INVENTION 

1. Technical Field. 

[0002] This invention relates to acoustics, and more 
particularly, to a system that enhances the perceptual 
quality of a processed voice. 

2. Related Art. 

[0003] Many hands-free communication devices ac- 
quire, assimilate, and transfer a voice signal. Voice sig- 
nals pass from one system to another through a com- 
munication medium. In some systems, including some 
used in vehicles, the clarity of the voice signal does not 
depend on the quality of the communication system or 
the quality of the communication medium. When noise 
occurs near a source or a receiver, distortion garbles the 
voice signal, destroys information, and in some instanc- 
es, masks the voice signal so that it is not recognized 
by a listener. 

[0004] Noise, which may be annoying, distracting, or 
results in a loss of information, may come from many 
sources. Within a vehicle, noise may be created by the 
engine, the road, the tires, or by the movement of air. A 
natural or artificial movement of air may be heard across 
a broad frequency range. Continuousfluctuations in am- 
plitude and frequency may make wind noise difficult to 
overcome and degrade the intelligibility of avoicesignal. 
[0005] Many systems attempt to counteract the ef- 
fects of wind noise. Some systems rely on a variety of 
sound-suppressing and dampening materials through- 
out an interior to ensure a quiet and comfortable envi- 
ronment. Other systems attempt to average out varying 
wind-induced pressures that press against a receiver. 
These noise reducers may take many shapes to filter 
out selected pressures making them difficult to design 
to the many interiors of a vehicle. Another problem with 
some speech enhancement systems is that of detecting 
wind noise in a background of a continuous noise. Yet 
another problem with some speech enhancement sys- 
tems is that they do not easily adapt to other communi- 
cation systems that are susceptible to wind noise. 
[0006] Therefore there is a need for a system that 
counteracts wind noise across a varying frequency 
range. 

SUMMARY 

[0007] A voice enhancement logic improves the per- 



ceptual quality of a processed voice. The system learns, 
encodes, and then dampens the noise associated with 
the movement of air from an input signal. The system 
includes a noise detector and a noise attenuator. The 

5 noise detector detects a wind buffet by modeling. The 
noise attenuator then dampens the wind buffet. 

Alternative voice enhancement logic includes time 
frequency transform logic, a background noise estima- 
tor, a wind noise detector, and a wind noise attenuator. 

10 The time frequency transform logic converts a time var- 
ying input signal into a frequency domain output signal. 
The background noise estimator measures the contin- 
uous noise that may accompany the input signal. The 
wind noise detector automatically identifies and models 

15 a wind buffet, which may then be dampened by the wind 
noise attenuator. 

[0008] Other systems, methods, features and advan- 
tages of the invention will be, or will become, apparent 
to one with skill in the art upon examination of the fol- 
20 lowing figures and detailed description. It is intended 
that all such additional systems, methods, features and 
advantages be included within this description, be within 
the scope of the invention, and be protected by the fol- 
lowing claims. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0009] The invention can be better understood with 
reference to the following drawings and description. The 
30 components in the figures are not necessarily to scale, 
emphasis instead being placed upon illustrating the 
principles of the invention. Moreover, in the figures, like 
referenced numerals designate corresponding parts 
throughout the different views. 
35 [0010] Figure 1 is a partial block diagram of voice en- 
hancement logic. 

[001 1] Figure 2 is noise that may be associated with 
wind and other sources in the frequency domain. 
[0012] Figure 3 is a signal-to-noise ratio of the noise 
40 that may be associated with wind and other sources in 
the frequency domain. 

[0013] Figure 4 is a block diagram of the voice en- 
hancement logic of Figure 1 . 

[0014] Figure 5 is a pre-processing system coupled 
45 to the voice enhancement logic of Figure 1 . 

[0015] Figure 6 is an alternative pre-processing sys- 
tem coupled to the voice enhancement logic of Figure 1 . 
[0016] Figure 7 is a block diagram of an alternative 
voice enhancement system. 
50 [0017] Figure 8 is noise that may be associated with 
wind and other sources in the frequency domain. 
[001 8] Figure 9 is a graph of a wind buffet masking a 
portion of a voice signal. 

[001 9] Figure 1 0 is a graph of a processed and recon- 
55 structed voice signal. 

[0020] Figure 1 1 is a flow diagram of a voice enhance- 
ment. 

[0021] Figure 12 is a partial sequence diagram of a 
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voice enhancement. 

[0022] Figure 13 is a partial sequence diagram of a 
voice enhancement. 

[0023] Figure 1 4 is a block diagram of voice enhance- 
ment logic within a vehicle. 

[0024] Figure 1 5 is a block diagram of voice enhance- 
ment logic interfaced to an audio system and/or a com- 
munication system. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

[0025] A voice enhancement logic improves the per- 
ceptual quality of a processed voice. The logic may au- 
tomatically learn and encode the shape and form of the 
noise associated with the movement of air in a real or a 
delayed time. By tracking selected attributes, the logic 
may eliminate or dampen wind noise using a limited 
memory that temporarily stores the selected attributes 
of the noise. Alternatively, the logic may also dampen a 
continuous noise and/or the "musical noise," squeaks, 
squawks, chirps, clicks, drips, pops, low frequency 
tones, or other sound artifacts that may be generated 
by some voice enhancement systems. 
[0026] Figure 1 is a partial block diagram of the voice 
enhancement logic 100. The voice enhancement logic 
may encompass hardware or software that is capable 
of running on one or more processors in conjunction with 
one or more operating systems. The highly portable log- 
ic includes a wind noise detector 1 02 and a noise atten- 
uator 104. 

[0027] In Figure 1 the wind noise detector 102 may 
identify and model a noise associated with wind flow 
from the properties of air. While wind noise occurs nat- 
urally or may be artificially generated over a broad fre- 
quency range, the wind noise detector 1 02 is configured 
to detect and model the wind noise that is perceived by 
the ear. The wind noise detector receives incoming 
sound, that in the short term spectra, may be classified 
into three broad categories: (1) unvoiced, which exhibits 
noise-like characteristics that includes the noise asso- 
ciated with wind ; i.e., it may have some spectral shape 
but no harmonic or formant structure; (2) fully voiced, 
which exhibits a regular harmonic structure, or peaks at 
pitch harmonics weighted by the spectral envelope that 
may describe the formant structure, and (3) mixed voice, 
which exhibits a mixture of the above two categories, 
some parts containing noise-like segments, the rest ex- 
hibiting a regular harmonic structure and/or a formant 
structure. 

[0028] The wind noise detector 1 02 may separate the 
noise-like segments from the remaining signal in a real 
or in a delayed time no matter how complex or how loud 
an incoming segment may be. The separated noise-like 
segments are analyzed to detect the occurrence of wind 
noise, and in some instances, the presence of a contin- 
uous underlying noise. When wind noise is detected, the 
spectrum is modeled, and the model is retained in a 



memory. While the wind noise detector 102 may store 
an entire model of a wind noise signal, it also may store 
selected attributes in a memory. 
[0029] To overcome the effects of wind noise, and in 

5 some instances, the underlying continuous noise that 
may include ambient noise, the noise attenuator 104 
substantially removes or dampens the wind noise and/ 
or the continuous noise from the unvoiced and mixed 
voice signals. The voice enhancement logic 1 00 encom- 

10 passes any system that substantially removes or damp- 
ens wind noise. Examples of systems that may dampen 
or remove wind noise include systems that use a signal 
and a noise estimate such as (1) systems which use a 
neural network mapping of a noisy signal and an esti- 

15 mate of the noise to a noise-reduced signal, (2) systems 
which subtract the noise estimate from a noisy-signal, 
(3) systems that use the noisy signal and the noise es- 
timate to select a noise-reduced signal from a code- 
book, (4) systems that in any other way use the noisy 

20 signal and the noise estimate to create a noise-reduced 
signal based on a reconstruction of the masked signal. 
These systems may attenuate wind noise, and in some 
instances, attenuate the continuous noise that may be 
part of the short-term spectra. The noise attenuator 1 04 

25 may also interface or include an optional residual atten- 
uator 106 that removes or dampens artifacts that may 
result in the processed signal. The residual attenuator 
106 may remove the "musical noise," squeaks, 
squawks, chirps, clicks, drips, pops, low frequency 

30 tones, or other sound artifacts. 

[0030] Figure 2 illustrates exemplary noise associat- 
ed with three wind flows. The wind buffets 202, 204, and 
206, which are the events of wind striking a detector 
vary by their level of severity or amplitude. The ampli- 

35 tudes reflect the relative differences in power or intensity 
between the fluctuations of air pressure received across 
an input area of a receiver or a detector. The line under- 
lying the wind buffets illustrates the continuous noise 
208 that is also sensed by the receiver or detector. In a 

40 vehicle, wind buffets may represent the natural flow of 
air through a window, through an open top of a convert- 
ible, through an inlet, or the artificial movement of air 
caused by a fan or a heating, ventilating, and/or air con- 
ditioning system (HVAC). The continuous noise may 

45 represent an ambient noise or a noise associated with 
an engine, a powertrain, a road, tires, or other sounds. 
[0031] In the time and frequency spectral domain, the 
continuous noise 208 and a wind buffet 202 may be cur- 
vilinear. The continuous noise and wind buffet may ap- 
se pear to be formed or characterized by the curved lines 
shown in Figure 2. However when the signal strength 
(in decibels) of the wind buffet (e.g., a wB ) is related to 
the signal strength of a continuous noise (e.g., o CN )) in 
the signal-to-noise ratio (SNR) domain, the wind buffet 

55 202 may be characterized by a linearfunction with a ver- 
tical dimension corresponding to decibels and a hori- 
zontal dimension corresponding to frequency. This re- 
lation may be expressed as: 
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SNR = <r WB -a CN (Equation 1) 

Any method may approximate the linearity of a wind buf- 
fet. In the signal-to-noise domain, an offset or y- intercept 
302 and an x-intercept or pivot point may characterize 
the linear model 302. Alternatively, an x or y-coordinate 
and a slope may model the wind buffet. In Figure 3, the 
linear model 302 descends in a negative slope. 
[0032] Figure 4 is a block diagram of an example wind 
noise detector 102 that may receive or detect an un- 
voiced, fully voiced, or a mixed voice input signal. A re- 
ceived or detected signal is digitized at a predetermined 
frequency. To assure a good quality voice, the voice sig- 
nal is converted to a pulse-code-modulated (PCM) sig- 
nal by an analog-to-digital converter 402 (ADC) having 
any common sample rate. A smooth window 404 is ap- 
plied to a block of data to obtain the windowed signal. 
The complex spectrum for the windowed signal may be 
obtained by means of a fast Fouriertransform (FFT) 406 
that separates the digitized signals into frequency bins, 
with each bin identifying an amplitude and phase across 
a small frequency range. Each frequency bin may then 
be converted into the power-spectral domain 408 and 
logarithmic domain 410 to develop a wind buffet and 
continuous noise estimate. As more windows of sound 
are processed, the wind noise detector 1 02 may derive 
average noise estimates. A time-smoothed or weighted 
average may be used to estimate the wind buffet and 
continuous noise estimates for each frequency bin. 
[0033] To detect a wind buffet, a line may be fitted to 
a selected portion of the low frequency spectrum in the 
SNR domain. Through a regression, a best-fit line may 
measure the severity of the wind noise within a given 
block of data. A high correlation between the best-fit line 
and the low frequency spectrum may identify a wind buf- 
fet. Whether or not a high correlation exists, may depend 
on a desired clarity of a processed voice and the varia- 
tions in frequency and amplitude of the wind buffet. Al- 
ternatively, a wind buffet may be identified when an off- 
set ory-intercept of the best-fit line exceeds a predeter- 
mined threshold (e.g., > 3 dB). 

[0034] To limit a masking of voice, the fitting of the line 
to a suspected wind buffet signal may be constrained 
by rules. Exemplary rules may prevent a calculated off- 
set, slope, or coordinate point in a wind buffet model 
from exceeding an average value. Another rule may pre- 
vent the wind noise detector 1 02 from applying a calcu- 
lated wind buffet correction when a vowel or another 
harmonic structure is detected. A harmonic may be 
identified by its narrow width and its sharp peak, or in 
conjunction with a voice or a pitch detector. If a vowel 
or another harmonic structure is detected, the wind 
noise detector may limit the wind buffet correction to val- 
ues less than or equal to average values. An additional 
rule may allow the average wind buffet model or its at- 
tributes to be updated only during unvoiced segments. 



If a voiced or a mixed voice segment is detected, the 
average wind buffet model or its attributes are not up- 
dated under this rule. If no voice is detected, the wind 
buffet model or each attribute may be updated through 

5 any means, such as through a weighted average or a 
leaky integrator. Many other rules may also be applied 
to the model. The rules may provide asubstantially good 
linear fit to a suspected wind buffet without masking a 
voice segment. 

10 [0035] To overcome the effects of wind noise, a wind 
noise attenuator 104 may substantially remove or 
dampen the wind buffet from the noisy spectrum by any 
method. One method may add the wind buffet model to 
a recorded or modeled continuous noise. In the power 

15 spectrum, the modeled noise may then be subtracted 
from the unmodified spectrum. If an underlying peak or 
valley 902 is masked by a wind buffet 202 as shown in 
Figure 9 or masked by a continuous noise, a conven- 
tional or modified interpolation method may be used to 

20 reconstruct the peak and/or valley as shown in Figure 
1 0. A linear or step-wise interpolator may be used to re- 
construct the missing part of the signal. An inverse FFT 
may then be usedto convertthe signal powerto thetime 
domain, which provides a reconstructed voice signal. 

25 [0036] To minimize the "music noise," squeaks, 
squawks, chirps, clicks, drips, pops, low frequency 
tones, or other sound artifacts that may be generated in 
the low frequency range by some wind noise attenua- 
tors, an optional residual attenuator 106 (shown in Fig- 

30 ure 1) may also condition the voice signal before it is 
converted to the time domain. The residual attenuator 
1 06 may track the power spectrum within a low frequen- 
cy range (e.g., less than about 400 Hz). When a large 
increase in signal power is detected an improvement 

35 may be obtained by limiting or dampening the transmit- 
ted power in the low frequency range to a predetermined 
or calculated threshold. A calculated threshold may be 
equal to, or based on. the average spectral power of that 
same low frequency range at an earlier period in time. 

40 [0037] Further improvements to voice quality may be 
achieved by pre-conditioning the input signal before the 
wind noise detector processes it. One pre-processing 
system may exploit the lag time that a signal may arrive 
at different detectors that are positioned apart as shown 

45 in Figure 5. If multiple detectors or microphones 502 are 
used that convert sound into an electric signal, the pre- 
processing system may include control logic 504 that 
automatically selects the microphone 502 and channel 
that senses the least amount of noise. When another 

50 microphone 502 is selected, the electric signal may be 
combined with the previously generated signal before 
being processed by the wind noise detector 1 02. 
[0038] Alternatively, multiple wind noise detectors 
102 may be used to analyze the input of each of the 

55 microphones 502 as shown in Figure 6. Spectral wind 
buffet estimates may be made on each of the channels. 
A mixing of one or more channels may occur by switch- 
ing between the outputs of the microphones 502. The 
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signals may be evaluated and selected on a frequency- 
by-frequency basis until the frequency of the pivot point 
304 (shown in Figure3) is reached. Alternatively, control 
logic 602 may combine the output signals of multiple 
wind noise detectors 1 02 at a specific frequency or fre- 
quency range through a weighting function. When the 
frequency of the pivot point is exceeded, the process 
may continue or a standard adaptive beam forming 
method may be used. 

[0039] Figure 7 is alternative voice enhancement log- 
ic 700 that also improves the perceptual quality of a 
processed voice. The enhancement is accomplished by 
time-frequency transform logic 702 that digitizes and 
converts a time varying signal to the frequency domain. 
A background noise estimator 704 measures the con- 
tinuous or ambient noise that occurs near a sound 
source or the receiver. The background noise estimator 
704 may comprise a power detector that averages the 
acoustic power in each frequency bin. To prevent biased 
noise estimations at transients, a transient detector 706 
disables the noise estimation process during abnormal 
or unpredictable increases in power. In Figure 7, the 
transient detector 706 disables the background noise 
estimator 704 when an instantaneous background noise 
B(f, i) exceeds an average background noise B (f) Ave 
by more than a selected decibel level 'c. ' This relation- 
ship may be expressed as: 

B(f,i)>B(f) Ave + c (Equation 2) 

[0040] To detect a wind buffet, a wind noise detector 
708 may fit a line to a selected portion of the spectrum 
in the SNR domain. Through a regression, a best-fit line 
may model the severity of the wind noise 202, as shown 
in Figure 8. To limit any masking of voice, the fitting of 
the line to a suspected wind buffet may be constrained 
by the rules described above. A wind buffet may be iden- 
tified when the offset or y-intercept of the line exceeds 
a predetermined threshold or when there is a high cor- 
relation between a fitted line and the noise associated 
with a wind buffet. Whether or not a high correlation ex- 
ists, may depend on a desired clarity of a processed 
voice and the variations in frequency and amplitude of 
the wind buffet. 

[0041] Alternatively, a wind buffet may be identified by 
the analysis of time varying spectral characteristics of 
the input signal that may be graphically displayed on a 
spectrograph. A spectrograph may produce a two di- 
mensional pattern called a spectrogram in which the 
vertical dimensions correspond to frequency and the 
horizontal dimensions correspond to time. 
[0042] A signal discriminator 71 0 may mark the voice 
and noise of the spectrum in real or delayed time. Any 
method may be used to distinguish voice from noise. In 
Figure 7, voiced signals may be identified by (1 ) the nar- 
row widths of their bands or peaks; (2) the resonant 



structure that may be harmonically related; (3) the res- 
onances or broad peaks that correspond to formant fre- 
quencies; (4) characteristics that change relatively slow- 
ly with time; (5) their durations; and when multiple de- 

5 tectors or microphones are used, (6) the correlation of 
the output signals of the detectors or microphones. 
[0043] To overcome the effects of wind noise, a wind 
noise attenuator 712 may dampen or substantially re- 
move the wind buffet from the noisy spectrum by any 

10 method. One method may add the substantially linear 
wind buffet model to a recorded or modeled continuous 
noise. In the power spectrum, the modeled noise may 
then be removed from the unmodified spectrum by the 
means described above. If an underlying peak or valley 

15 902 is masked by a wind buffet 202 as shown in Figure 
9 or masked by a continuous noise, a conventional or 
modified interpolation method may be used to recon- 
struct the peak and/or valley as shown in Figure 10. A 
linear or step-wise interpolator may be used to recon- 

20 struct the missing part of the signal. A time series syn- 
thesizer may then be used to convert the signal power 
to the time domain, which provides a reconstructed 
voice signal. 

[0044] To minimize the "musical noise," squeaks, 

25 squawks, chirps, clicks, drips, pops, low frequency 
tones, or other sound artifacts that may be generated in 
the low frequency range by some wind noise attenua- 
tors, an optional residual attenuator 714 may also be 
used. The residual attenuator 714 may track the power 

30 spectrum within a low frequency range. When a large 
increase in signal power is detected an improvement 
may be obtained by limiting the transmitted power in the 
low frequency range to a predetermined or calculated 
threshold. A calculated threshold may be equal to or 

35 based on the average spectral power of that same low 
frequency range at a period earlier in time. 
[0045] Figure 1 1 is a flow diagram of a voice enhance- 
ment that removes some wind buffets and continuous 
noise to enhance the perceptual quality of a processed 

40 voice. At act 1 1 02 a received or detected signal is digi- 
tized at a predetermined frequency. To assure a good 
quality voice, the voice signal may be converted to a 
PCM signal by an ADC. At act 1 1 04 a complex spectrum 
for the windowed signal may be obtained by means of 

45 an FFT that separates the digitized signals into frequen- 
cy bins, with each bin identifying an amplitude and a 
phase across a small frequency range. 
[0046] At act 1 1 06, a continuous or ambient noise is 
measured. The background noise estimate may com- 

50 prise an average of the acoustic power in each frequen- 
cy bin. To prevent biased noise estimations at tran- 
sients, the noise estimation process may be disabled 
during abnormal or unpredictable increases in power at 
act 1 1 08. The transient detection act 1 1 08 disables the 

55 background noise estimate when an instantaneous 
background noise exceeds an average background 
noise by more than a predetermined decibel level. 
[0047] At act 1110, a wind buffet may be detected 
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when the offset exceeds a predetermined threshold (e. 
g., a threshold > 3 dB) or when a high correlation exits 
between a best-fit line and the low frequency spectrum. 
Alternatively, a wind buffet may be identified by the anal- 
ysis of time varying spectral characteristics of the input 
signal. When a line fitting detection method is used, the 
fitting of the line to the suspected wind buffet signal may 
be constrained by some optional acts. Exemplary op- 
tional acts may prevent a calculated offset, slope, or co- 
ordinate point in a wind buffet model from exceeding an 
average value. Another optional act may prevent the 
wind noise detection method from applying a calculated 
wind buffet correction when a vowel or another harmon- 
ic structure is detected. If a vowel or another harmonic 
structure is detected, the wind noise detection method 
may limit the wind buffet correction to values less than 
or equal to average values. An additional optional act 
may allow the average wind buffet model or attributes 
to be updated only during unvoiced segments. If a 
voiced or mixed voice segment is detected, the average 
wind buffet model or attributes are not updated under 
this act. If no voice is detected, the wind buffet model or 
each attribute may be updated through many means, 
such as through a weighted average or a leaky integra- 
tor. Many other optional acts may also be applied to the 
model. 

[0048] At act 1 1 1 2, a signal analysis may discriminate 
or mark the voice signal from the noise-like segments. 
Voiced signals may be identified by, for example, (1 ) the 
narrow widths of their bands or peaks; (2) the resonant 
structure that may be harmonically related; (3) their har- 
monics that correspond to formant frequencies; (4) 
characteristics that change relatively slowly with time; 
(5) their durations; and when multiple detectors or mi- 
crophones are used, (6) the correlation of the outputsig- 
nals of the detectors or microphones. 
[0049] To overcome the effects of wind noise, a wind 
noise is substantially removed or dampened from the 
noisy spectrum by any act. One exemplary act 1114 
adds the substantially linear wind buffet model to a re- 
corded or modeled continuous noise. In the power spec- 
trum, the modeled noise may then be substantially re- 
moved from the unmodified spectrum by the methods 
and systems described above. If an underlying peak or 
valley 902 is masked by a wind buffet 202 as shown in 
Figure 9 or masked by a continuous noise, a conven- 
tional or modified interpolation method may be used to 
reconstruct the peak and/or valley at act 1116. A time 
series synthesis may then be used to convert the signal 
power to the time domain at act 1 1 20, wh ich provides a 
reconstructed voice signal. 

[0050] To minimize the "musical noise," squeaks, 
squawks, chirps, clicks, drips, pops, low frequency 
tones, or other sound artifacts that may be generated in 
the low frequency range by some wind noise processes, 
a residual attenuation method may also be performed 
before the signal is converted back to the time domain. 
An optional residual attenuation method 1118 may track 



the power spectrum within a lowfrequency range. When 
a large increase in signal power is detected an improve- 
ment may be obtained by limiting the transmitted power 
in the low frequency range to a predetermined or calcu- 

5 lated threshold. A calculated threshold may be equal to 
or based on the average spectral power of that same 
low frequency range at a period earlier in time. 
[0051] Figures 12 and 13 are partial sequence dia- 
grams of a voice enhancement. Like the method shown 

10 in Figure 11 , the sequence diagrams may be encoded 
in a signal bearing medium, a computer readable medi- 
um such as a memory, programmed within a device 
such as one or more integrated circuits, or processed 
by a controller or a computer. If the methods are per- 

15 formed by software, the software may reside in a mem- 
ory resident to or interfaced to the wind noise detector 
102, a communication interface, or any other type of 
non-volatile or volatile memory interfaced or resident to 
the voice enhancement logic 1 00 or 700. The memory 

20 may include an ordered listing of executable instructions 
for implementing logical functions. A logical function 
may be implemented through digital circuitry, through 
source code, through analog circuitry, or through an an- 
alog source such through an analog electrical, audio, or 

25 video signal. The software may be embodied in any 
computer-readable or signal-bearing medium, for use 
by, or in connection with an instruction executable sys- 
tem, apparatus, or device. Such a system may include 
a computer-based system, a processor-containing sys- 

30 tern, or another system that may selectively fetch in- 
structions from an instruction executable system, appa- 
ratus, or device that may also execute instructions. 
[0052] A "computer-readable medium," "machine- 
readable medium," "propagated-signal" medium, and/or 

35 "signal-bearing medium" may comprise any means that 
contains, stores, communicates, propagates, or trans- 
ports software for use by or in connection with an in- 
struction executable system, apparatus, or device. The 
machine-readable medium may selectively be, but not 

40 limited to, an electronic, magnetic, optical, electromag- 
netic, infrared, or semiconductor system, apparatus, de- 
vice, or propagation medium. A non-exhaustive list of 
examples of a machine-readable medium would in- 
clude: an electrical connection "electronic" having one 

45 or more wires, a portable magnetic or optical disk, a vol- 
atile memory such as a Random Access Memory "RAM" 
(electronic), a Read-Only Memory "ROM" (electronic), 
an Erasable Programmable Read-Only Memory 
(EPROM or Flash memory) (electronic), or an optical f ib- 

50 er (optical). A machine-readable medium may also in- 
clude a tangible medium upon which software is printed, 
as the software may be electronically stored as an im- 
age or in anotherformat (e.g., through an optical scan), 
then compiled, and/or interpreted or otherwise proc- 

55 essed. The processed medium may then be stored in a 
computer and/or machine memory. 
[0053] As shown in the first sequence of Figure 12 ; a 
time series signal may be digitized and smoothed by a 
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Hanning window to provide an accurate estimation of a 
fully voiced, a mixed voice, or an unvoiced segment. The 
complex spectrum for the windowed signal is obtained 
by means of an FFT that separates the digitized signals 
into frequency bins, with each bin identifying an ampli- 
tude across a small frequency range. 
[0054] In the second sequence, an averaging of the 
acoustic power in each frequency bin during unvoiced 
segments derives the background noise estimate. To 
prevent biased noise estimates, noise estimates may 
not occur when abnormal or unpredictable power fluc- 
tuations are detected. 

[0055] In the third sequence, the unmodified spec- 
trum is digitized, smoothed by a window, and trans- 
formed into the complex spectrum by an FFT. The un- 
modified spectrum exhibits portions containing noise- 
like segments and other portions exhibiting a regular 
harmonic structure. 

[0056] In the fourth sequence, a sound segment is fit- 
ted to separate lines to model the severity of the wind 
and continuous noise. To provide a more complete ex- 
planation, an unvoiced, fully voiced, and mixed voiced 
sample are shown. The frequency bins in each sample 
were converted into the power-spectral domain and log- 
arithmic domain to develop a wind buffet and continuous 
noise estimate. As more windows are processed, the 
average wind noise and continuous noise estimates are 
derived. 

[0057] To detect a wind buffet, a line is fitted to a se- 
lected portion of the signal in the SNR domain. Through 
a regression, best-fit lines model the severity of the wind 
noise in each illustration. A high correlation between one 
best-fit line and the low frequency spectrum may identify 
a wind buffet. Alternatively, a y-intercept that exceeds a 
predetermined threshold may also identify a wind buffet. 
To limit the masking of voice, the fitting of the line to a 
suspected wind buffet signal may be constrained by the 
rules described above. 

[0058] To overcome the effects of wind noise, the 
modeled noise may be dampened in the unmodified 
spectrum. In Figure 1 3, the dampening of the wind buf- 
fets and continuous noise from the unvoiced and mixed 
voiced sample are shown in the fifth sequence. An in- 
verse FFT that converts the signal power to the time do- 
main provides the reconstructed voice signal. 
[0059] From the foregoing descriptions it should be 
apparent that the above-described systems may condi- 
tion signals received from only one microphone or de- 
tector. It should also be apparent, that many combina- 
tions of systems may be used to identify and track wind 
buffets. Besides the fitting of a line to a suspected wind 
buffet, a system may (1 ) detect the peaks in the spectra 
having a SNR greater than a predetermined threshold; 
(2) identify the peaks having a width greater than a pre- 
determined threshold; (3) identify peaks that lack a har- 
monic relationships; (4) compare peaks with previous 
voiced spectra; and (5) compare signals detected from 
different microphones before differentiating the wind 



buffet segments, other noise like segments, and regular 
harmonic structures. One or more of the systems de- 
scribed above may also be used in alternative voice en- 
hancement logic. 
5 [0060] Other alternative voice enhancement systems 
include combinations of the structure and functions de- 
scribed above. These voice enhancement systems are 
formed from any combination of structure and function 
described above or illustrated within the attached fig- 
ures. The logic may be implemented in software or hard- 
ware. The term "logic" is intended to broadly encompass 
a hardware device or circuit, software, or a combination. 
The hardware may include a processor or a controller 
having volatile and/or non-volatile memory and may al- 
so include interfaces to peripheral devices through wire- 
less and/or hardwire mediums. 

[0061] The voice enhancement logic is easily adapt- 
able to any technology or devices. Some voice en- 
hancement systems or components interface or couple 
vehicles as shown in Figure 1 4, instruments that convert 
voice and other sounds into a form that may be trans- 
mitted to remote locations, such as landline and wireless 
telephones and audio equipment as shown in Figure 15, 
and other communication systems that may be suscep- 
tible to wind noise. 

[0062] The voice enhancement logic improves the 
perceptual quality of a processed voice. The logic may 
automatically learn and encode the shape and form of 
the noise associated with the movement of air in a real 
or a delayed time. By tracking selected attributes, the 
logic may eliminate or dampen wind noise using a lim- 
ited memory that temporarily or permanently stores se- 
lected attributes of the wind noise. The voice enhance- 
ment logic may also dampen a continuous noise and/or 
the squeaks, squawks, chirps, clicks, drips, pops, low 
frequency tones, or other sound artifacts that may be 
generated within some voice enhancement systems 
and may reconstruct voice when needed. 
[0063] While various embodiments of the invention 
have been described, it will be apparent to those of or- 
dinary skill in the art that many more embodiments and 
implementations are possible within the scope of the in- 
vention. Accordingly, the invention is not to be restricted 
except in light of the attached claims and their equiva- 
lents. 



Claims 

1 . A system for suppressing wind noise from a voiced 
or unvoiced signal, comprising: 

a noise detectorthat detects and models a wind 
buffet from an input signal; and 
a noise attenuator electrically connected to the 
noise detectorto substantially remove the wind 
buffet from the input signal. 
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2. The system for suppressing wind noise of claim 1 
where the noise detector models a line to a portion 
of the input signal. 

3. The system of claim 2 where the noise detector is 
configured to fit a line to a portion of the input signal 
in a SNR domain. 

4. The system of claim 1 where the noise detector is 
configured to model the wind buffet by calculating 
a signal offset. 

5. The system of claim 1 where the noise detector is 
configured to prevent the attributes of the modeled 
wind buffet from exceeding their respective average 
values. 

6. The system of claim 1 where the noise detector is 
configured to limit a wind buffet correction when a 
vowel or a harmonic like structure is detected. 

7. The system of claim 1 where the noise detector is 
configured to derive an average wind buffet model, 
and the average wind buffet model is not updated 
when a voiced or a mixed voice signal is detected. 

8. The system of claim 1 where the noise detector is 
configured to derive an average wind buffet model 
that is derived by a weighted average of other mod- 
eled signals analyzed earlier in time. 

9. The system of claim 1 where the noise attenuator 
is configured to substantially remove the wind buffet 
and a continuous noise from the input signal. 

10. The system of claim 1 further comprising a residual 
attenuator electrically coupled to the noise detector 
and the noise attenuatorto dampen signal power in 
a low frequency range when a large increase in a 
signal power is detected in the low frequency range. 

11. The system of claim 1 further including an input de- 
vice electrically coupled to the noise detector, the 
input device configured to convert sound waves into 
analog signals. 

12. The system of claim 1 further including a pre- 
processing system coupled to the noise detector, 
the pre-processing system configured to pre-condi- 
tion the input signal before the wind noise detector 
processes it. 

13. The system of claim 12 where the pre-processing 
system comprises first and second microphones 
spaced apart and configured to exploit a lag time of 
a signal that may arrive at the different detectors. 

14. The system of claim 13 further comprising control 



logic that automatically selects a microphone and a 
channel that senses the least amount of noise in the 
input signal. 

5 15. The system of claim 13 further comprising a second 
noise detector coupled to the noise detector and the 
first microphone. 

16. A system for detecting wind noise from a voiced and 
10 unvoiced signal, comprising: 

a time frequency transform logic that converts 
a time varying input signal into the frequency 
domain; 

15 a background noise estimator coupled to the 

time frequency transform logic, the background 
noise estimator configured to measure the con- 
tinuous noise that occurs near a receiver; and 
a wind noise detector coupled to the back- 
20 ground noise estimator, the wind noise detector 

configured to automatically identify and model 
a noise associated with wind. 

17. The system of claim 16 further comprising a tran- 
25 sient detector configured to disable the background 

noise estimator when a transient signal is detected. 

18. The system of claim 16 where the wind noise de- 
tector is configured to derive a correlation between 

30 the line and a portion of the input signal. 

1 9. The system of claim 1 6 further comprising a signal 
discriminatorcoupledtothewind noise detector the 
signal discriminator configured to mark the voice 

35 and the noise segments of the input signal. 

20. The system of claim 1 6 further comprising a wind 
noise attenuator coupled to the wind noise detector, 
the wind noise attenuator configured to reduce the 

40 noise associated with the wind that is sensed by the 
receiver. 

21 . The system of claim 1 6 where the noise attenuator 
is configured to substantially remove the noise as- 

45 sociated with the wind from the input signal. 

22. The system of claim 16 further comprising a resid- 
ual attenuator coupled to the background noise es- 
timator operable to dampen signal power in a low 

50 frequency range when a large increase in signal 
power is detected in the low frequency range. 

23. A system for suppressing wind noise from a voiced 
or unvoiced signal, comprising: 

55 

a time frequency transform logic that converts 
a time varying input signal into the frequency 
domain; 
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a background noise estimator coupled to the 
time frequency transform logic, the background 
noise estimator configured to measure the con- 
tinuous noise that occurs near a receiver; 
a wind noise detector coupled to the back- 
ground noise estimator, the wind detector con- 
figuredtofit a line to a portion of an inputsignal; 
and 

a wind attenuator coupled to the wind noise de- 
tector means; the wind attenuator being config- 
ured to remove a noise associated with wind 
that is sensed by the receiver. 

24. A method of removing a wind buffet from an input 
signal comprising: 

converting a time varying signal to a complex 
spectrum; 

estimating a background noise; 
detecting a wind buffet when a high correlation 
exists between a line and a portion of an input 
signal; and 

dampening the wind buffet from the input sig- 
nal. 

25. The method of claim 24 where the act of estimating 
the background noise comprises estimating the 
background noise when a transient is not detected. 

26. The method of claim 24 where the act of removing 
the wind buffet signal comprises substantially re- 
moving the wind buffet from the input signal. 

27. A method of removing a wind buffet from an input 
signal comprising: 

converting a time varying signal to a complex 
spectrum; 

estimating a background noise; 
detecting a wind buffet when a high correlation 
exists between a line and a portion of an input 
signal; and 

removing the wind buffet from the input signal. 

28. A signal-bearing medium having software that con- 
trols a detection of a noise associated with a wind 
comprising: 

a detector that converts sound waves into elec- 
trical signals; 

a spectral conversion logic that converts the 
electrical signals from a first domain to a sec- 
ond domain; and 

a signal analysis logic that models a portion of 
the sound waves that are associated with the 
wind. 

29. The signal-bearing medium of claim 28 further com- 



prising logic that derives a portion of a voiced signal 
masked by the noise. 

30. Thesignal-bearing medium of claim 28 further com- 
5 prising logic that attenuates portion of the sound 

waves. 

31. Thesignal-bearing medium of claim 28 further com- 
prising attenuator logic operable to limit a power in 

10 a low frequency range. 

32. Thesignal-bearing medium of claim 28 further com- 
prising noise estimation logic that measures a con- 
tinuous or ambient noise sensed by the detector. 

15 

33. Thesignal-bearing medium of claim 32 further com- 
prising transient logic that disables the estimation 
logic when an increase in power is detected. 

20 34. The signal-bearing medium of claim 28 where the 
signal analysis logic is coupled to an audio system. 

35. The signal-bearing medium of claim 28 where the 
signal analysis logic models only the sound waves 
25 that are associated with the wind. 
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